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ABSTRACT 

We extend the Fourier transform based method for the analysis of galaxy redshift surveys of 
Feldman, Kaiser & Peacock (1994: FKP) to model luminosity-dependent clustering. In a mag- 
nitude limited survey, galaxies at high redshift are more luminous on average than galaxies at 
low redshift. Galaxy clustering is observed to increase with luminosity, so the inferred den- 
sity field is effectively multiplied by an increasing function of radius. This has the potential 
to distort the shape of the recovered power spectrum. In this paper we present an extension 
of the FKP analysis method to incorporate this effect, and present revised optimal weights to 
maximize the precision of such an analysis. The method is tested and its accuracy assessed 
using mock catalogues of the 2-degree field galaxy redshift survey (2dFGRS). We also show 
that the systematic effect caused by ignoring luminosity-dependent bias was negligible for the 
initial analysis of the 2dFGRS of Percival et al. (2001). However, future surveys, sensitive to 
larger scales, or covering a wider range of galaxy luminosities will benefit from this refined 
method. 

Key words: cosmology: observations - large-scale structure of Universe - cosmological 
parameters - surveys 



1 INTRODUCTION 



The shape of the matter power spectrum as traced by large-scale 
structure is one of the cornerstones of modern cosmology. The 
shape is dependent on the primordial fluctuations, on the total mat- 
ter density Q. m through Q, m h which controls the matter-radiation 
horizon scale, and through the fraction of matter in baryons 
fli,/fi TO . Although these parameter combinations are nearly degen- 
erate, preliminary results from the 2-degree field galaxy redshift 
survey (2dFGRS) contained sufficient signal to weakly break this 
degeneracy (Percival et al. 2001: P01). The complete 2dFGRS, the 
Sloan Digital Sky Survey (SDSS; York et al. 2000) and forthcoming 
high-redshift surveys will refine these measures of the cosmologi- 
cal matter content from large-scale structure observations. 

High-quality measurements of the cosmic microwave back- 
ground (CMB) power spectrum (e.g. Bennett et al. 2003; Hinshaw 
et al. 2003) have brought about a new era of high-precision cosmol- 
ogy, in which the analysis of large-scale structure (LSS) data as- 
sumes even greater importance. Many of the degeneracies between 
cosmological parameters intrinsic to CMB data are broken by the 
addition of constraints from LSS data (e.g. Efstathiou et al. 2002; 
Percival et al. 2002; Spergel et al. 2003; Verde et al. 2003). Thus, 
the scientific potential of both data sets can be greatly enhanced 
through combination. However, the physics and instrumental ef- 
fects involved in the interpretation of LSS data are more compli- 
cated and less well understood than for the CMB data. To use LSS 



reliably in the context of high-precision cosmology, it is therefore 
important to consider the survey analysis method and any possible 
systematic problems in detail. 

There are a number of difficulties and potential systematic 
effects associated with recovering the shape of the matter power 
spectrum from that of the galaxies. Distances to galaxies are de- 
rived from redshifts, and peculiar velocities induce systematic dis- 
tortions in the recovered power. Although this effect is important 
and potentially carries cosmological information (e.g, Peacock et 
al. 2001 ; Hawkins et al. 2003; see Hamilton 1998 for a review), we 
will not address it here, but we will consider a more subtle effect. 
Galaxies are biased tracers of the matter distribution: the relation 
between the galaxy and mass density fields is probably both non- 
linear and stochastic to some extent (e.g. Dekel & Lahav 1999), so 
that the power spectra of galaxies and mass differ in general. If we 
define scale-dependent bias via 



P g (k)=b 2 (k)P m (k), 



(1) 



where subscripts m and g denote matter and galaxies respectively, 
there is no guarantee that b(k) will be a constant. The best we can 
hope for is that there will be a 'linear response' limit, in which b(k) 
tends to a constant bu n on large scales. Although it is easy to invent 
artificial models in which this is not true, the concept of linear bias 
does hold for many bias models - and in particular for the most de- 
tailed attempts to include all the physics of galaxy formation (e.g. 
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Benson et al. 2000). Such simulations can also give a realistic idea 
of the scales on which the linear-bias assumption breaks down, and 
we assume for the purposes of this paper that linear bias is true 
(or that small deviations from scale-independence can be corrected 
for). Nevertheless, even in the linear-bias limit, bias can complicate 
the analysis of LSS data, because the value of fei in will be different 
for different classes of galaxy. The purpose of this paper is to in- 
vestigate the extent to which this can affect the recovered power 
spectrum shape. 

The fact that galaxies selected in different ways have different 
clustering properties has been known for some time (e.g. Davis & 
Geller 1976; Peacock & Dodds 1994; Seaborne et al. 1999). For 
the 2dFGRS galaxies, although the average bias is close to unity 
(Lahav et al. 2002; Verde et al. 2002), the bias is dependent on 
galaxy luminosity (Norberg et al. 2001; 2002; Zehavi et al. 2002 
find a very similar dependence for SDSS galaxies), with 

where the bias b(L) is assumed to be a simple function of galaxy 
luminosity and L, is defined such that M bl — 51og 10 h = —19.7 
(Norberg et al. 2001). 

In a magnitude limited survey, galaxies of different luminos- 
ity are probed at different radii: at high redshift, galaxies more lu- 
minous than average (and with lower number density) dominate 
the sample, while at low redshift galaxies less luminous than av- 
erage (and with higher number density) dominate. As a conse- 
quence, the power spectrum at large scales is measured prefer- 
entially from galaxies more luminous than average, and on small 
scales from galaxies less luminous than average. Without correc- 
tion, luminosity-dependent bias would therefore distort the shape 
of the recovered galaxy power spectrum from that of the matter 
power spectrum (see Tegmark et al. 2003). 

In this paper we consider how best to estimate the shape of 
the underlying matter power spectrum given a sample of galaxies 
that have a clustering amplitude dependent on galaxy properties. 
Specifically, in Section 2 we consider a set of galaxies with a sim- 
ple luminosity-dependent bias such that, on the scales of interest 
P g (k) = b 2 (L)P m {k). For generality, the expected bias need not 
be simply a function of luminosity; all that is required is that the 
bias can be predicted from some combination of the properties of 
each galaxy. The results of this paper are therefore relevant to more 
general problems where we have a mixed set of galaxies tracing the 
same density field. For instance the method would be applicable 
to type-dependent clustering or a survey covering a large redshift 
range where the linear evolution of the matter power spectrum and 
evolution in the bias of the objects selected were important. 

Previous studies have already extended the calculation of op- 
timal weights presented by FKP. Hamilton (1997) considered the 
effect of the window function which the FKP method assumes to 
be negligible. More recently, Yamamoto (2003) attempted to de- 
rive optimal weights including the effect of both redshift-space 
distortions and the light-cone effect, which, as we will illustrate 
later, has a similar net effect as luminosity dependent bias. In this 
paper, the work of FKP is extended in a different context to the 
work of Yamamoto (2003), and we consider the systematic effect 
of luminosity-dependent bias on the recovered power in addition 
to providing optimal weights. Interestingly, our derived optimal 
weights, following an independent calculation, differ from those 
of Yamamoto (2003) and a comparison is given in Section 5. 

The layout of this paper is as follows. First, in Section 2.1 we 
present a generalization of the power spectrum analysis method in- 



troduced by Feldman, Kaiser & Peacock (1994: FKP), extended to 
correct for galaxy luminosity dependent bias. We then follow the 
FKP derivation leading to revised optimal weights in Section 2.3, 
in the limit of a large survey volume. The method and weights are 
tested in Section 3 using simple mock catalogues with a window 
function similar to that of the 2dFGRS, and the relevance for the 
analysis of P01 is considered in Section 4. We conclude in Sec- 
tion 5. 



2 ANALYSIS OF A MIXED LUMINOSITY SAMPLE 

In this Section we extend the method of FKP to cover a sample 
of galaxies with different large-scale biases. In order to make the 
description transparent, we adopt the notation of FKP, and refer 
extensively to this paper. In particular, we adopt the Fourier trans- 
form convention of FKP, equivalent to that of Peebles (1980) with 
V = l. 



2.1 Method 

Let us consider a set of galaxies of luminosity L forming a Poisson 
sampling of a linearly biased density field 

l + 6 g (r) = l + b(r,L)6(r). (3) 

Here b(r, L) is the linear bias of galaxies of luminosity L at po- 
sition r. We have generalized the idea of bias being a function of 
L only to allow for e.g. slow evolution of b(L) within the survey 
volume. The probability that volume element SV contains a galaxy 
of luminosity L is given by 

Prob(<5V,L) =6Vn(r,L)[l + b(r,L)6(r)], (4) 

where n(r,L) is the mean expected number density of galaxies at 
r with luminosity L. The power spectrum and correlation function 
form a Fourier pair with £(r) = £(r) = (S(r')5(r' + r)), and 

P(k) = P{k) = J d 3 r £(r)e lfc r . (5) 

Here and hereafter P(k) denotes the power spectrum of the 
underlying matter fluctuation field S. If, rather than considering the 
bias of each galaxy, we had instead considered the ratio of the bias 
to that of a particular galaxy type, then we simply have to redefine 
P(k) as the power spectrum of that galaxy type. Replacing b(r, L) 
by b(r, L)/b(r, L*) would mean that P(k) would have to be re- 
defined as the power spectrum of L, galaxies so, for the 2dFGRS 
galaxies, we could use Eq. 2 to recover the power spectrum of L, 
galaxies. 

Following the FKP approach, we also consider a synthetic cat- 
alogue with the same radial and angular sampling (selection func- 
tion), and luminosity distribution as the galaxy catalogue but with 
no correlations. This synthetic catalogue will be used (Eq. 6) to 
convert from a galaxy density field to a galaxy overdensity field. 
We will multiply objects in both the galaxy and synthetic catalogue 
by w(r, L)/b(r, L) - here the split distinguishes the bias correc- 
tion and additional weighting needed to optimize the precision of 
the power estimate. For clarity in the following text, we only use 
the "weight" to refer to w(r, L), not w(r, L)/b(r, L). 

The weighted galaxy fluctuation field is defined as 

F{r) = ± J dL^^[n g (r,L)-an s (r,L)] (6) 
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where n 9 (r, L) — ^\ 5(r — rj)S(L — Lj), with Tj being the 
location of the jth galaxy of luminosity Lj, and n s (r, L) is de- 
fined similarly for the synthetic catalogue. Here a is a constant that 
matches the two catalogues (see Section 2.4), and N is a normal- 
ization constant defined by 



P{k) = 



N 




dL n(r , L)w(r , L) 



1/2 



(7) 



Following appendix A of FKP, the two-point functions of n 9 , n s 
are 

(n g (r,L)n 3 (r',L')) = 

n(r, L)n(r', L?) [l + b(r, L)b(r' , L')£(r - r')] 
+n(r, L)5(r — r')5(L — L'), (8) 

(n s (r, L)n 3 (r', L')) = a~ 2 n(r, L)n(r' , L') 

+aT 1 n(r, L)5(r — r')5(L — L 1 ), (9) 

(n g {r, L)n s (r', L'j) = a _1 n(r, L)n{r' , L'), (10) 

where we have explicitly included the bias caused by the different 
galaxy luminosities. The expected value (F(r)F(r')) is therefore 
given by 



(F(r)F(r')) = G(r)G(r')£(r - r') + £shot(r, r'), 
where 



dLn(r, L)w(r, L), 



(11) 



(12) 



G(r) 
and 

C shot (r,r') = ^ J dLn(r,L)^^5(r-r'). (13) 

The first term in Eq. 11 corresponds to the multiplication of the 
correlation function by the window. The second is the shot noise 
term. Switching to Fourier space we have, as in FKP, that 

(\F(k)\ 2 )= J -0^P(k')\G(k-k')\ 2 +P shot . (14) 

The multiplication in real space (the first term in Eq. 11) has be- 
come a convolution in Fourier space (the first term in Eq. 14), but 
now 



G(fc) = 



and 



d 3 r I dLn(r, L)w(r, L)e^ tr , 



(15) 



(16) 



„ 1 + a f ,a f , r , r .w 2 (r. 

P shot = — j d A r J d Ln(r,L)^ 

Ignoring the bias changes between galaxies with different luminosi- 
ties reduces G(k) and P s hot to the FKP expressions. 

The factorization of the multiplicative factor in Eq. 6 is now 
clear: although we multiply by w(r , L) /b(r, L), the window func- 
tion shape (Eq. 15) and normalization (Eq. 7) are only dependent 
on the weights w(r, L): the additional l/b(r, L) factor simply cor- 
rects the measured galaxy fluctuations to an estimate of the matter 
fluctuations, and does not affect the window. 

Our estimator of P(k) convolved with the window function 
is given by P(k) — |F(fe)| 2 — P s hot- Averaging over a shell in 
fc-space, gives our final estimator of the convolved power P{k), 
P(k), 



V k 



d z k'P{k'), 



(17) 



where Vk is the volume of the shell. Ignoring redshift-space 
distortions, P(k) is the true power spectrum convolved with a 
spherically-averaged window, which can be computed by spher- 
ically averaging |G(fc)| 2 , with G(k) given by Eq. 15. Redshift- 
space distortions mean that the interpretation of Eq. 17 is actually 
more complicated, and P(k) depends on the full survey window. In 
the spirit of generalizing FKP, such effects are not included here; a 
separate study of this issue would nevertheless be of interest. 

We can now contrast the FKP procedure with the exact analy- 
sis of luminosity-dependent clustering, as expressed in Eqns. 14 
- 16. FKP used luminosity-independent weights, equivalent to 
w(r,L) = b(r, L) w F Kp(r) in our current notation. The result- 
ing density fluctuation field (Eq. 6) differs from the correspond- 
ing quantity in FKP, because the normalization factor, N, contains 
w(r, L). We therefore have F FKP (r) — b c gF(r), where 



2 _ Jd 3 r[JdLn{r,L)b(r,L)w FKP {r)]- 

'eff — 2 

J d 3 r [J dLn(r,L) w FK p(r)] 



(18) 



This effective bias is potentially more serious than just a shift in the 
overall normalization, since w F kp = 1/[1 + nP(k)] is a function 
of wavenumber: a different F(r) is to be transformed for each k 
value. This means that b g is also a function of k, and the shape 
of the recovered spectrum is systematically altered. However, most 
analyses (including P01) have not taken the FKP mantra to this 
extreme, and have in practice assumed a single value for P(k) at 
all wavenumbers. Therefore, the P01 analysis is not affected by this 
potential difficulty. 

Even so, there is a second difference between our results and 
FKP, which may be seen in Eq. 15. The derived power spectrum 
is an estimate of the true power convolved with a window func- 
tion |G(fc)| 2 , and the correct form for this window differs from the 
corresponding function in the FKP analysis. As shown in P01, this 
convolution significantly changes the shape of the recovered power 
spectrum, and so we are led to ask whether the change in the win- 
dow is important; we show below in Section 3.4 that it is not. 

Finally, although we are thus able to identify potential system- 
atic errors that can arise from application of the FKP procedure to 
luminosity-dependent clustering, this does not address the issue of 
optimality. In fact, the original FKP weight is not optimal in gen- 
eral - nor is the simple bias-corrected multiple of it. A derivation of 
the correct optimal weight (in the usual limit of a narrow fc-space 
window) is presented in Section 2.3. 



2.2 Statistical fluctuations in the convolved 3D power 

We now proceed to analyse the error in our power estimator, as in 
FKP section 2.2. We assume that the Fourier components F(k) 
are Gaussian-distributed, so the variance in our estimate of the 
convolved power (SP(k)SP(k')) = \{F(k)F* (fe')}| 2 (FKP Ap- 
pendix B). On the large-scales of interest this is a good approxi- 
mation in the limit of a compact window function. Following the 
same steps as in FKP section 2.2, our estimate of the mean square 
fluctuations in the recovered convolved power (F(k)F* (k + 5k)) 
is given by 



{F(k)F*(k + 5k)) ~ P(k)Q(5k) + S{5k), 
where 



(19) 
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= / dV / dLn(r,L)w(r,L) 



jkr 



(20) 



Q(k) 
and 

S(k) = ^Jd a rJ dLn(r,L)^^,^. (21) 

This leads to an estimate of the error in the convolved power 
(SP(k)SP(k')) = \P(k)Q(Sk) + S(Sk)\ 2 . (22) 

2.3 Optimal Weighting 

Optimal weights can now be derived for such an analysis following 
a direct extrapolation of FKP section 2.3. 

We assume that the window function is compact in fc-space 
compared with the scales of interest; this will be discussed in more 
detail in Section 5. The mean square fluctuation in our estimate of 
the power is 



Op(fc) EE 



(23) 



P(k) - P(kj\ 

= 772 / d3fc / d 3 k' (SP(k)SP(k')) . (24) 

V k Jv k JV k 

If the shell over which we average has a width that is large 
compared to the effective width of Q(fc), but small compared with 
the variation in P(k), then this double integral reduces to 



(25) 



a 2 P (k)~±-J d 3 k'\P(k)Q(k') + S(k')\ 2 . 

Using the definitions of Q(k) and S(k) from Eqns. 20 & 21, and 
Parseval's theorem, the fractional variance in the power is 



P 2 {k) 



#/*{[/ 



1 



dLn(r,L) 



dLn(r, L)w(r, L) 



(r,L) 



b 2 (r,L) 



P{k) 



(26) 



We wish to find w(r, L) that minimize Eq. 26. Without loss of 
generality, we consider a small variation in the weights w(r,L) — > 
w(r, L)+5(L — L')8w'(r). To keep the equations concise, we set 
w = w(r, L), n = n(r, L), b = b(r, L) and P = P(k). Simi- 
larly, w' ee w(r, L'), n = n(r, L') and b' = b(r, L'). Requiring 
that op (k) be stationary with respect to arbitrary variations Sw' (r ) 
gives that 



Jd 3 r 


(JdLn W ) 2 +(jdL^)'_ 




n' (J dLnw) + 


Sw' 


J d*r 


(J dLnw}' 2 + 




2 



J d 3 rn' (/ dLnw) Sw' 
j d 3 r (J dL nw) 2 
The non-trivial solution to this Equation is 

w(r,L')- ^L')P(k) 



1 + / dLn(r,L)b 2 (r,L)P(k)' 



(27) 



(28) 



which is the principal result of this paper. 

This formula was found by considering initially the case of 
two sets of galaxies with different luminosities, so the integral is 
replaced by a sum over the two subsets. The resulting equations 
were then solved with the aid of MATHEMATICA. The form of the 



2-class solution suggested a conjecture for the general solution with 
a countable number of sets of galaxies, and thus for a set of galaxies 
with a continuous distribution of luminosities. The conjecture was 
readily verified by direct substitution. It is straightforward to see 
that this reduces to the formula of FKP (their equation 2.3.4) for a 
sample of galaxies at a single luminosity. 

Eq. 28 shows that w(r, L) depends not only on the expected 
bias of galaxies of that luminosity b(r,L), but additionally on 
the bias of galaxies of all luminosities at this location through 
J dLn(r, L)b 2 (r, L). This follows because the balance between 
shot noise and cosmic variance, which is at the heart of the deriva- 
tion of the optimal weights, depends on what we learn from all 
galaxies whatever their luminosity. It is also interesting to note that, 
while the contribution of galaxies to the overdensity estimate ap- 
pears to be inversely proportional to their bias (the 1 /b(r, L) fac- 
tor in Eq. 6), optimizing the weight actually means that the net 
contribution of these galaxies is increased - they contain the most 
signal-to-noise because of their strong clustering. 

2.4 Choice of a 

The value of a in Eq. 6 sets the expected number of galaxies for 
a particular survey. In this Section we discuss how this can be set 
given no information other than the survey itself. 

Suppose that we had used an incorrect value of a in Eq. 6, so 
that this Equation became 

F(r) = ^JdL ^± [n 9 (r, L) - (a - /?)n.(r, L)] , (29) 

then the effect of the additional component is to introduce an addi- 
tional term 



a 2 J (2tt 



|G(fc)| 2 



(30) 



to the right hand side of Eq. 14. Turning this argument around, we 
see that not knowing the true value of a is equivalent to adding a 
multiple of the Fourier transformed window function to our power 
estimate. In general, the average weighted galaxy density has to 
be derived from the survey itself, and it is therefore impossible to 
know the true offset between the number of galaxies observed and 
that expected. In this situation, the only sensible thing to do is to 
set 



S<?rfdL$£$n g (r,L) 
f<PrfdL$$ n ,(r,L)' 



(31) 



so that the average weighted overdensity is artificially set to zero. 
This self-normalization forces P(0) = and results in a deficit in 
the estimated power equivalent to subtracting a scaled copy of the 
window function, centered on k = (Peacock & Nicholson 1991). 
In the limit of no window, the effect of this self-normalization 
would be to remove a delta function located at k = such that 
P(0) = 0. In effect, this procedure evades the possibility of noise 
in our estimate of large-scale power owing to the uncertainty in n, 
at the expense of systematic damping of the large-scale signal (cf. 
the approach of Tadros & Efstathiou 1995). 

2.5 A note on the convention adopted 

Instead of allowing for a continuous distribution of galaxy lumi- 
nosities and corresponding biases, an alternative approach would 
have been to consider a countable number of sets of galaxies with 
different luminosities. In this case, Eq. 6 would have become 
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F{r) 



J_ \ - w(r, Li 
N 2-^ b(r, 



[n g (r,Li) -an s (r,Li)] 



(32) 



where n g (r,Li) = ^\ S(r — rj) now gives the density of galaxies 
within subset i, the subset containing those galaxies of luminosity 
Li. The analysis and derivation of optimal weights described in 
this paper follows exactly as for the continuous case, except that 
we obviously need to replace the integral over luminosity with a 
sum over the subsets of galaxies throughout. 



3 APPLICATION TO MOCK CATALOGUES 

Having introduced the necessary formalism, we now illustrate the 
revised FKP method using simple artificial surveys. These artifi- 
cial surveys were designed to approximate the selection function 
and luminosity-dependent bias of the 2dFGRS galaxies analysed 
by POL 



3.1 Selection function 

The selection function is approximately matched to the part- 
complete 2dFGRS survey (Colless et al. 2001) as analysed in P01. 
The redshift distribution used for the galaxies is given by 



cl \ 1-25 

f(z) = z exp 



(0.I3) 



(33) 



and limited to < 2 < 0.25. 

This radial selection function corresponds to the angle- 
averaged selection function of the survey as analyzed in P01 in- 
cluding both the NGP, SGP and random fields. However the se- 
lection function does not exactly match that of P01 as it does not 
include the varying survey depth of the 2dFGRS sample (see Col- 
less et al. 2001 for details). As we simply wish to compare and 
contrast our revised analysis method and the effect of not taking 
into consideration luminosity-dependent bias, this difference is not 
important. The expected number of galaxies in each catalogue was 
set at 140 000, although the actual number varies through Poisson 
sampling (see Section 3.2). 

In the following analysis we use a512 x 512 x 256 grid that 
just covers the survey region: given the survey geometry, the rectan- 
gular nature of this grid yields roughly similar sampling, and there- 
fore Nyquist frequencies, in each Cartesian direction. 

3.2 Creating the mock catalogues 

In this Section we describe how the mock galaxy catalogues, drawn 
from lognormal random fields, were generated (this closely follows 
the method described in Coles & Jones 1991). Lognormal random 
fields were used for convenience because they obey the physical 
limit S(r) > —1 Vr, and approximate the present-day non-linear 
fluctuation field. 

The underlying matter power spectrum was chosen to be a 
ACDM model calculated from the transfer function fitting formu- 
lae of Eisenstein & Hu (1999) coupled with the non-linear fitting 
formulae of Smith et al. (2002). The following cosmological pa- 
rameter values were set: f2 m = 0.3, f2 b /Sl m = 0.15, fU = 0.7, 
h = 0.7, n s — 1 (consistent with the recent WMAP results - 
Spergel et al. 2003). The normalization of the power spectrum (cho- 
sen to be as = 1) is unimportant for our analysis, which focuses 
on the relative change in the recovered power with various weight- 
ing schemes rather than the absolute value. In addition, we will 



use Eq. 2 to determine bias relative to L* galaxies rather than the 
absolute bias, so the power spectrum normalization chosen should 
correspond to that of L» galaxies. 

This power spectrum was determined on the grid described 
in Section 3.1. In order to avoid a sharp cut-off in the power, we 
introduce a smooth turn over at 0.1 times the minimum Nyquist 
frequency, ^Ny, which cuts the input power at 0.25^N y . This does 
not affect any of our conclusions as we are only interested in the 
large-scale power k < 0.15 /iMpc -1 . The input power was in- 
verse Fourier transformed to obtain the correlation function of the 
lognormal field required. The covariance of the Gaussian field, 
required to generate the lognormal field, is then obtained from 
£g(t") = hi[l + £ln(7")], and this was converted back to the power. 
A Gaussian density field Sg was then generated on the grid with 
this power spectrum, and the corresponding lognormal field was 
calculated, given by <5ln = exp((5G — <?g/2) — L where a G is the 
variance of the Gaussian density field. 

The lognormal density field was then used to create a cata- 
logue that matches the luminosity-dependent clustering of the 2dF- 
GRS sample. To facilitate this process we made a number of simpli- 
fications. First, we used the same grid both to create the catalogue 
and to estimate the power spectrum. There was therefore no need 
to create a full catalogue; instead, we can determine a catalogue al- 
ready sampled on the grid. The additive nature of the Poisson dis- 
tribution meant that the total number of galaxies at each grid point 
could be calculated by drawing a random Poisson variable with 
mean given by the selection function multiplied by the lognormal 
field and the mean expected bias at that grid point. These galaxies 
then need to be assigned luminosities. As we simply wish to use 
the mock catalogues to test the analysis method, we choose to only 
model the net effect, which is that the effective bias at a survey 
location depends on the distance from the observer. We therefore 
assume that the galaxies at each grid point all have the same lumi- 
nosity, given by the average luminosity of the 2dFGRS sample at 
that distance, and that they have the same average expected bias. 
This approach can be interpreted as a special case where we are 
modeling power spectrum evolution along the line-of-sight rather 
than luminosity-dependent clustering. 

We now describe how we calculated the biased lognormal den- 
sity field. For the 2dFGRS sample of P01, the mean expected bias 
as a function of redshift is well fit by the formula 



(b(r)) 



dL n(r, L)b(r, L) = 0.85 + Qz 1 



(34) 



where z is the redshift corresponding to a given radius. 

There are a number of ways that one could apply this formula 
to create a biased density field. Perhaps the most rigorous method 
would be to create a number of lognormal fields with the same 
phases and different amplitudes, each corrected from the desired 
clustering strength as detailed above. We should then use the field 
corresponding to the desired power amplitude at each grid point. 
However, this would have been computationally very expensive. In 
the linear regime of interest, the fields are all of small amplitude 
and the correction between Gaussian and lognormal fields is very 
small. We therefore created a single lognormal field corresponding 
to the largest mean expected bias in the 2dFGRS sample b(L) ~ 
1 Ab(L* ) , and multiplied this field by the expected mean bias given 
by Eq. 34 divided by 1.4. By choosing to create the field for the 
largest mean bias we ensure that Jln^) > —1 Vr. Alternatives 
would have been to bias either the Gaussian field Sg — > 6(L)<$g» 
or to set (1 + <5ln) — ► (1 + (Jln) 6 ' 1 "'. This choice is not signif- 
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icant, and all options result in fields with 5ln(^) > — 1 Vr, 
approximately the correct varying clustering strength. 



and 



3.3 Analysis of mock catalogues 

In order to test our derivation of optimal weights, we consider three 
weighting schemes. First we consider the weights derived to be op- 
timal for a narrow fc-space window 



Wi = 



b 2 (r,L)P(k) 



1 + J dLn{r,L)b 2 {r,L)P(k)' 



(35) 



The second weighting scheme that we consider is designed to 
highlight how luminosity-dependent bias affects the results of P01, 
and is given by 



u> 2 = 



b(r,L)P(k) 



1 + / dLn(r,L)b 2 {r,L)P(k)' 



(36) 



The b(r,L) term in the numerator cancels the l/b(r,L) factor 
in Eq. 6 applied to each galaxy to correct the differing clustering 
strengths. Using these weights is therefore equivalent to the original 
FKP scheme, except that there is an additional b 2 (r, L) in the de- 
nominator. This form was chosen rather than the exact FKP weight 
because this additional term is simply equivalent to varying P(k) 
as a function of r, and keeping the denominator fixed for all three 
weights enables the effect of the bias term in the numerator to be 
more easily understood. 

In addition we consider weights that apply no distinction be- 
tween galaxies with different expected biases 



w s = 



P(k) 



1 + J dLn(r,L)b 2 {r,L)P(k)' 



(37) 



For the analysis presented in the next Section, we assumed a 
fixed value of P(k) = 5000 ft 3 Mpc -3 for all of the weighting 
schemes. The squared window functions |G(fc)| 2 were calculated 
as in Eq. 15, and were spherically averaged as for the power esti- 
mate (Eq. 17). The data were then fitted with a smooth curve cal- 
culated using a Spline3 algorithm (Press et al. 1992). This curve 
was found to be a better fit to the shape of the recovered win- 
dow at k > 0.1 ft Mpc" 1 than the fitting formula of POL For 
k > 0.6 ft Mpc" 1 , we extrapolate the window function as fc" 4 . 
This extrapolation does not significantly affect the shape of the re- 
covered convolved power, although it does have a slight effect on 
the normalization. 

As we have shown in Eqns. 6 & 15, although the galaxies were 
multiplied by the weight divided by the expected bias, the window 
function is only dependent on the weight. In an FKP analysis, such 
as P01, the weight assumed for the window function is the same 
as the multiplicative factor in Eq. 6, w(r,L)/b(r,L). For exam- 
ple, if weights u>2 were applied to the galaxies in an FKP analysis 
of a catalogue with luminosity dependent clustering, weights u>3 
would be assumed for the window function. Similarly, if weights 
wi were applied to the galaxies, weights W2 would be assumed for 
the window function. Because we fix P(k) in these weights as in 
P01, then the change in the shape of the recovered power spectrum 
is only dependent on this change in the window (see Section 2.1). 
The importance of the changing window on the shape of the power 
spectrum is considered in the next Section where we present the re- 
covered power spectra from our mock catalogues calculated using 
weights wi, W2, and W3. 

Because we create the mock catalogues and power estimates 
on the same grid, we know the selection function at each grid point 




0.02 0.05 0.1 

wave number k / h Mpc 1 

Figure 1. Top panel: the average recovered power spectrum from 1000 
mock catalogues calculated within a selection function designed to mimic 
the 2dFGRS sample analysed by P01 (solid circles). These data were 
weighted using weights wi given by Eq. 35. For comparison we also plot 
the input power spectrum (solid line) and the power spectrum convolved 
with the fit to the spherically averaged Fourier window function (dashed 
line). In the lower panel, these data are compared with the models in more 
detail, and we plot the ratio of the average recovered power to the con- 
volved model (solid circles with la errors). The open diamonds show the 
ratio of the data to the expected convolved power including correction for 
the self-normalization induced by matching the average galaxy density (see 
Section 2.4 for details). 



exactly. We can therefore easily subtract the mean galaxy density 
without having to create a synthetic catalogue. In order to deter- 
mine the shot-noise level and the power normalization we require 
two integrals, given by Eqns. 16 & 7 respectively. As we are assum- 
ing that the distribution of galaxy luminosities is a delta function at 
each grid point, and we know the selection function at these points, 
then it is relatively straightforward to calculate these two quantities 
by numerically integrating over the grid used. In the following we 
apply the method described in Section 2.1 to estimate the power 
at 100 points evenly spaced in < k < 0.1287T ftMpc" 1 . This 
sampling is matched to that of POL 

3.4 Results 

We have created 1000 lognormal catalogues as described in Sec- 
tion 3.3, and have analysed these catalogues using the three dif- 
ferent weighting schemes given in Section 3.3. In this paper we 
only consider the recovered power for 0.02 < k < 0.15 ftMpc" 1 . 
This matches the region considered in P01 and avoids the non- 
linear regime k > 0.15 ft Mpc -1 , where the Gaussian behaviour 
of the lognormal catalogues is expected to break down. In fact we 
do find a turn-up of the data for k > 0.15 ft Mpc -1 , consistent 
with the non-linear behaviour of the lognormal model. The top 
panel of Fig. 1 shows the average recovered power spectrum for 
0.02 < k < 0.15 ft Mpc" 1 , calculated from the mock catalogues 
weighted with weights w\ (solid circles) compared to the power ex- 
pected from fitting to the radially averaged Fourier window func- 
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wave number k / h Mpc 



Figure 2. The average recovered power spectrum from 1000 mock cata- 
logues, weighted as given by Eqns. 35 to 37, ratioed to the input power 
(symbols). These data are compared with the expected power calculated by 
convolving a fit to the spherically averaged Fourier window function with 
the input power (solid lines). The dotted lines include the correction for 
the self-normalization induced by matching the average galaxy density (see 
Section 2.4 for details). 



tion for weighting scheme w 1 (dashed line); the solid line is the 
input power spectrum (not convolved with the window). 

In order to examine the fit in detail and compare the average 
recovered power from the three weighting schemes, we ratio this 
power to the input power spectrum in Fig. 2. Here we see that the 
increasing factor of bias in the weights u>3 — ► t«i increases the ef- 
fective size of the window in real space, and therefore reduces the 
effect of the window in Fourier space: weighting scheme wi pro- 
duces the largest real-space volume as it increases the importance 
of the rare high-luminosity galaxies at high redshift the most. 

The recovered diagonal elements of the covariance matrix cal- 
culated using the three weighting schemes were very similar in 
amplitude, so the number of realizations analysed was insufficient 
to distinguish the relative errors in the recovered power produced 
by the different the weighting schemes using this statistic. How- 
ever, the smaller fc-space window function created using weights 
«)i does mean that this scheme is better at finding features in the 
power spectrum compared with the other weighting schemes. 

The difference between the convolved power for u>2 and 
W3 corresponds to the small offset induced by not correcting for 
luminosity-dependent clustering. This is illustrated in Fig. 3, where 
the open circles show the ratio of the average power spectrum re- 
covered from the mock catalogues using the original FKP method 
(calculated using weights u>2, Eq. 36) with the expected power (the 
input power spectrum convolved with the window function corre- 
sponding to weight wi). The solid line show the expected ratio of 
1. The errors on the data are dominated by cosmic variance and 
are given by the square root of the diagonal elements of the co- 
variance matrix. The solid squares with one sigma errors show the 
ratio of the the same recovered power spectrum with the matter 
power spectrum convolved with the window relative to weights W3, 
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wave number k / h Mpc 

Figure 3. The average recovered power from the mock catalogues, calcu- 
lated using weight W2 given by Eq. 36 ratioed to the expected power (open 
circles with la errors). The solid line shows the expected ratio of 1. The 
solid squares with lcr errors show the same average recovered data, com- 
pared to the model convolved with the window calculated from weights 
W3, those assumed in the standard FKP analysis. The dashed line shows the 
expected ratio in this case. The errors on the data are dominated by cos- 
mic variance: ratioing the two recovered average power spectra for the two 
weights gives the open squares. We see that not allowing for luminosity- 
dependent clustering, equivalent to assuming the wrong window, changes 
the power spectrum normalization and induces a slight change of slope. 

the window assumed without correcting for luminosity-dependent 
clustering. The dashed line shows the expected ratio. In the next 
section we consider how this offset affects the 2dFGRS analysis of 
POL 



4 APPLICATION TO 2DFGRS ANALYSIS OF P01 

In the analysis of P01, no correction was made for the varying bias 
caused by the range in galaxy luminosities within the sample. As 
described in Section 2.1, this causes the incorrect window func- 
tion to be assigned to the analysis. In the top panel of Fig. 4 we 
compare the recovered 2dFGRS power spectrum data with a model 
power spectrum convolved with both the window function calcu- 
lated using the FKP method, and that calculated after correction 
for luminosity dependent bias (calculated with weights given by 
b(L)w(r), see Section 2.1). The corrected window function has a 
smaller effect on the power compared with the original because the 
extra bias weighting increases the effective size of the survey. The 
models in this figure vary slightly from those presented and used by 
P01, as we have fitted the spherically averaged window functions 
using Spline3 fits as described in Section 3.3. Changing how each 
spherically averaged window was fitted has very little effect on the 
shape of the recovered convolved power spectra, but it can affect 
the normalization (~ 10%), depending on the exact nature of the 
fits. This will be discussed further in a subsequent paper. 

The effect of not assuming the correct window when fitting 
the recovered power spectrum slope and amplitude of the 2dFGRS 
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wave number k / h Mpc 



Figure 4. Top panel: the 2dFGRS power spectrum data of P01 (solid points) 
with errors given by root of the diagonal elements of the covariance ma- 
trix. These data are compared with the concordance model (f! m = 0.3, 
f2b/^m = 0.15, Qa = 0.7, h = 0.7, n s = 1) power spectrum con- 
volved with the window function used in P01 (lower thin solid line), and 
with the corrected window function allowing for luminosity-dependent bias 
(upper thick solid line). In this plot, the 2dFGRS data have been fitted to the 
upper thick solid line. The dotted lines include the correction for the self- 
normalization of the galaxy number density. All of the data are ratioed to the 
unconvolved concordance model power spectrum. The difference between 
the two convolved models is negligible compared to the errors in the 2dF- 
GRS power. The ratio between the two convolved models gives the effect 
of not correcting for luminosity-dependent bias, and is plotted in the lower 
panel of this Figure (solid line). The change in shape induced is closely 
matched by a change in the spectral index of the fitted model - the dot- 
dash line shows the ratio of a convolved power spectrum with n = 1 and 
the window function allowing for luminosity dependent bias divided by a 
power spectrum with n = 0.995 and a window function as assumed in a 
FKP analysis: the shape of the power spectrum recovered in a universe with 
n = 1 would be fitted by a power spectrum with n = 0.995 convolved 
with the original FKP window. 

analysis of P01 is quantified in the lower panel of Fig. 4. The solid 
line is the ratio of the expected power spectrum from a sample of 
galaxies with luminosity-dependent bias within a concordance uni- 
verse ratioed to a theory power spectrum with the same cosmolog- 
ical parameters but with n = 0.995 instead of n = 1, convolved 
with window assumed in POL This small shift in n is sufficient to 
correct the change in shape caused by assuming the wrong window, 
and is approximately equivalent to a change A£l m h ~ 0.0014. 
This change is obviously very small compared with the statistical 
errors on the recovered parameters. There is also a small system- 
atic error on the power spectrum amplitude of < 2%, reflecting the 
change in the normalization of the spherically averaged windows. 
The solid line in the lower panel of Fig. 4 is comparable with the 
dashed line in Fig. 3, which shows the same effect for the mock 
catalogues. The difference between the two lines is caused by the 
change in the denominator of Eqns. 36 & 37, compared with the 
original FKP weightings. The extra factor of b (r, L) in Eqns. 36 
& 37 decreases the importance of the high redshift data, so the fac- 
tor of b(r, L) on the numerator has more effect. 



Using the revised window when fitting to the published 2dF- 
GRS power spectrum data of P01 changes the recovered Sl m h and 
^b/fim values very slightly (to higher Q, m h and lower baryon 
fraction), but gives best-fit parameters Q m h = 0.20 ± 0.03 and 
Qb/Qm — 0. 15±0. 07 (68% confidence interval, assuming n 3 = 1 
& h = 0.7 ± 0.07), that are the same as those reported by P01 at 
this significance. 



5 DISCUSSION 

We have presented a method for the Fourier analysis of galaxy red- 
shift surveys that allows for luminosity-dependent clustering. Al- 
though this generalization of the FKP analysis method is specif- 
ically designed to allow for luminosity-dependent clustering, the 
derivation is actually more general and can be applied to any mixed 
distribution of galaxies with differing clustering strength. Conse- 
quently, it can cope with type-dependent clustering, power spec- 
trum evolution and bias evolution, and should therefore be of use 
in the analysis of future deeper redshift surveys. The method re- 
quires knowledge of the relative change in the bias as a function of 
galaxy properties, although to recover the matter power spectrum 
amplitude, the absolute bias is required. 

The optimal weights presented in this paper (Eq. 28) dif- 
fer from those of Yamamoto (2003), ignoring the correction for 
redshift-space distortions (equation 29 of Yamamoto 2003). This 
results from the fact that we included the effect of luminosity- 
dependent clustering at the inception of our optimal weight deriva- 
tion and minimized the error in the underlying matter power spec- 
trum. Yamamoto (2003) instead minimized the error in the galaxy 
power spectrum (biased power in the language of this paper). Con- 
sequently in our weighting scheme, luminous galaxies are given 
a higher weight than less luminous galaxies: they contribute more 
signal-to-noise to the measure of the underlying fluctuations than 
less luminous galaxies. However, this is not true if we wished to 
optimize for the biased power as shown by Yamamoto (2003). The 
weighting scheme that should be adopted depends on the statistic 
to be measured: if we wish to measure the underlying matter power 
spectrum, then the weights derived in Section 2.3 are more appli- 
cable. 

The derivation of optimal weights was performed in the limit 
of a negligible window function. On scales comparable to the size 
of the window, the optimal weights will change, and will depend 
on the survey geometry such that the weights no longer have radial 
symmetry (e.g. Hamilton 1997). Although the three radially sym- 
metric weighting schemes considered in Section 3.3 differed by a 
factor of ~ 2 over the redshift range of interest, they resulted in 
remarkably similar diagonal errors in the power spectrum. If we 
constrain the weights to be radially symmetric then small devia- 
tions away from the derived optimal distribution do not appear to 
have a significant effect on the recovered errors. 

For P01, the error induced by not allowing for the effect of 
luminosity-dependent clustering was shown to be negligible in Sec- 
tion 4. One of the factors that contributed to this was that, in the 
weights applied, P(k) was fixed at P(k) = 5000 h 3 Mpc~ 3 . This 
was not the case in many other Fourier analyses of galaxy redshift 
surveys (e.g. Tegmark, Hamilton & Xu 2002) which use a weight 
that varies as a function of k. Varying the estimated power used in 
Eq. 28 changes the normalization N (Eq. 7), which works in con- 
junction with the change in shape of the spherically averaged win- 
dow to alter the recovered power. As we have shown in this paper, 
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it is possible to correct these effects by using the correct window 
function at each fc-value. 

Redshift space distortions caused by peculiar velocities will 
also affect the shape of the recovered power spectrum. In the spirit 
of generalizing FKP, such effects are not included here. How- 
ever P01 used mock catalogues drawn from the Hubble volume 
simulation to show that these were not significant on the scales 
(0.02 < k < 0.15 h Mpc -1 ) considered in this work. This issue 
will be explored further in a later paper. 
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